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Abstract 

An out-tree T of a directed graph D is a. rooted tree subgraph with all arcs directed 
outwards from the root. An out-branching is a spanning out-tree. By i{D) and £s{D) 
we denote the maximum number of leaves over all out-trees and out-branchings of D, 
respectively. 

We give fixed parameter tractable algorithms for deciding whether £s{D) > k and 
whether i{D) > k for a digraph D on n vertices, both with time complexity 2'^('^''°s'=) . 
7jO(i)_ This improves on previous algorithms with complexity 2'^^'"' ■ n'^^^^ and 

20(fciog=fc) .„o(i)^ respectively. 

To obtain the complexity bound in the case of out-branchings, we prove that when all 
arcs of D are part of at least one out-branching, is{D) > £{D)/3. The second bound we 
prove in this paper states that for strongly connected digraphs D with minimum in-degree 
3, £s{D) > &{y/n), where previously is{D) > 9(^/n) was the best known bound. This 
bound is tight, and also holds for the larger class of digraphs with minimum in-degree 3 
in which every arc is part of at least one out-branching. 



1 Introduction 

Many important graph problems are well-studied on undirected graphs unlike their general- 
izations to directed graphs. One reason may be that despite their practical significance, it is 
generally harder to obtain similar results for directed graphs. Max-Leaf Spanning Tree 
is such a problem that has received a lot of study, both algorithmically and combinatorial. 
This optimization problem is defined as follows: given an undirected graph, find a spanning 
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tree with maximum number of leaves. In the decision version of this problem, in addition an 
integer k is given, and the question is whether a spanning tree with at least k leaves exists 
(A;-Leaf Spanning Tree). These problems are motivated by many practical and theoreti- 
cal applications (see for instance OUS]), though in some cases, the applications call for a 
directed generalization [B], which is what we study in this paper. 

For directed graphs or digraphs we use notions that are defined for undirected graphs, 
such as paths, trees, connectedness and vertex neighborhoods. These are defined as expected, 
where arc directions are ignored. An out-tree of a digraph D is a tree subgraph where every 
vertex has in-degree 1 except for one, the root, which has in-degree 0. An out-branching is 
a spanning out-tree. A leaf is a vertex with out-degree 0. In the directed generalization of 
the problem, one asks for an out-branching with maximum number of leaves. This problem 
is called Max-Leaf Out-Branching. By £{D) and we denote the maximum number 

of leaves over all out-trees and out-branchings of D respectively (when considering is{D) we 
assume that D has at least one out-branching). Clearly i{D) > is{D) holds, but in contrast to 
undirected graphs, we do not always have equality here. In fact the ratio i{D) / £s{D) can be 
arbitrarily large. Therefore, on digraphs, the problem of finding an out-tree with maximum 
number of leaves (Max-Leaf Out-Tree) is of independent interest. The corresponding de- 
cision problems where the question is asked whether is(D) > k or whether £{D) > k are called 
fc-LEAF Out-Branghing and fc-LEAF Out-Tree, respectively. The related problem of find- 
ing out-branchings with minimum number of leaves has also been considered recently |15j . 
In the first part of this paper we are concerned with algorithmic questions, and in the sec- 
ond part we study the combinatorial question of finding lower bounds for i{D) and ls{D). 
Throughout this section n denotes the number of vertices of the graph under consideration. 

The A^P-hardness of all problems above follows from the A^P-completeness of /c-Leaf 
Spanning Tree. Whereas for the undirected problem, Max-Leaf Spanning Tree, a 
2-approximation is known [18], the best known approximation result for Max-Leaf Out- 
Branghing is a very recent algorithm with ratio 0{y/n) [TT]. In the algorithmic part of 
this work, we are interested in fixed parameter tractable (FPT) algorithms for the decision 
problems. We choose the desired number of leaves k as the parameter. Then an algorithm 
is an FPT algorithm if its time complexity is bounded by a function of the form f{k) ■ n^^^\ 
where the parameter function f may be any computable function only depending on k. FPT 
algorithms are well-studied and classified. The book of Downey and Fellows [10] provides 
an introduction into parameterized complexity. See the books of Flum and Grohe [13] and 
Niedermeier [17] for more recent introductions into parameterized complexity. The main 
indicator of the practicality of FPT algorithms is the growth rate of the parameter function, 
and which is one important reason to design FPT algorithms with small parameter function. 
For the undirected problem /c-Leaf Spanning Tree many improvements have been made 
in this area (see e.g. |12t [5]). which has also has been a large stimulus for research on related 
combinatorial questions. The current fastest FPT algorithm has a running time of O* (6.75*^) 
0{m), with m being the number of edges [7j. 

Considering the directed versions of the problem, Alon et al [2] were the first to give an 
FPT algorithm for /c-Leaf Out-Tree, which had a running time 2'^(*^ • n'^^^\ In [l] 

they improved this to 2'-'^^^°^ ■ n^^^\ They also observed that for digraph classes where 
out-trees with k leaves can always be extended to out-branchings with k leaves, this solves k- 
Leaf Out-Branghing with the same time complexity, and that this property holds for the 
important classes of strongly connected digraphs and acyclic digraphs. For acyclic digraphs. 
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they also gave a specialized algorithm for /c-Leaf Out-Branching with a complexity of 
20(fciogfc) . ^o(i) p_ Q^iy 

very recently, the question whether an FPT algorithm exists for 
fc-LEAF Out-Branching for all digraphs has been resolved, by giving an algorithm with 
complexity 2'^('=' • n'^^^) [6]. 

In this paper we present FPT algorithms for both A;-Leaf Out-Tree and /c-Leaf Out- 
Branching with parameter function 2'^^^^"^^\ This improves the complexity of all FPT 
algorithms for digraphs mentioned above, except for the algorithm for acyclic digraphs, which 
has the same complexity. 

In another line of research, max-leaf problems have been studied in a purely combinatorial 
manner. For instance, for the undirected version, a well-known (tight) bound states that 
undirected graphs with minimum degree 3 have a spanning tree with at least n/4+2 leaves |16j . 
Similar bounds appear in [71 [9]. For digraphs, it is much harder to obtain tight bounds, or even 
bounds that are tight up to a constant factor. Alon et al [l] showed that for strongly connected 
digraphs D with minimum in-degree 3, ls{D) > 1 (this improves their previous bound 

from [2]). In addition they construct strongly connected digraphs D with minimum in-degree 
3 with is{D) = 0{^/n). Considering the gap between this lower bound and upper bound, it 
is asked in [1] what the minimum value of r is such that is{D) > f{n) S G(\/n) for all graphs 
in this class (2 < r < 3). 

In this paper we answer this question by showing that for strongly connected digraphs D 
with minimum in-degree 3, is{D) > \\/n. Considering the examples from [1], we see that this 
bound is tight (up to a constant factor). Furthermore we generalize this result by showing 
that is{D) > fi'iT') £ Q{\/n) holds for the larger class of digraphs with minimum in-degree 3 
without useless arcs. Useless arcs are arcs that are not part of any out-branching. 

Overview of new techniques The analysis that is required to prove the correctness of 
our improved algorithms is entirely new. However, the algorithms themselves are similar to 
those from [1], and in particular to the one given in [6j. Therefore we give a short overview 
of the techniques used to obtain the previous FPT algorithms for /c-Leaf Out-Tree and 
A;-Leaf Out-Branching, and then give an overview of the new ideas and techniques in this 
paper. 

In [6], it is first observed that useless arcs may be deleted from the digraph. For the 
resulting digraph D, in [6| a variant of the algorithms introduced in [2] and [T] is used: 
starting with an arbitrary out-branching, small changes are made that increase the number 
of leaves, until a locally optimal out-branching T is obtained. Back arcs of T are those arcs 
of D that form a directed cycle together with a part of T. If at every point in T (we omit 
the precise definition used in [6]) there are at most 6k'^ back arcs, then a path decomposition 
of D is constructed with width w < 6k^, which allows for a dynamic programming procedure 
with complexity 2^^'^^°^'^^ • n to be used. On the other hand, if the number of back arcs is at 
least 6k^ at some point, it is shown that an out-branching with at least k leaves exists. This 
last proof makes heavy use of the fact that no useless arcs are present. 

In this paper, we construct a tree decomposition instead of a path decomposition (the 
locally optimal out-branching that we start with actually serves as the skeleton for the tree 
decomposition), and use a better way to group back arcs. These two simple improvements do 
not only make the algorithm conceptually simpler, but also allow us to decrease the parameter 
function 2'^^'^'°^ for A;-Leaf Out-Tree from [1] by a logarithmical factor in the exponent 
to 20(^i°s'=). 
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Our main technical contribution of this paper consists of two combinatorial bounds. The 
first of these bounds allows us to obtain a parameter function of 2^^^^°^^^ also for /c-Leaf 
Out-Branching. For out-branchings, our research is motivated by the following question: 
for digraphs without useless arcs, what is the highest possible ratio i{D)/is{D)'! Figured] 
shows an example of a digraph without useless arcs where i{D)/is{D) = 2. In the first of the 
two main bounds of this paper, we prove that this ratio cannot be much larger; we prove that 
if D contains no useless arcs, then i{D)/is{D) < 3. Since this ratio is bounded by a constant, 
an algorithm for fe-LEAF Out-Branching with the same complexity is then easily obtained. 




Figure 1: A digraph without useless arcs with i{D) = n — 2 (use r as root) and is{D) = 
(n — 2)/2 (r' has to be the root). 

To prove our second bound, the lower bound on is{D) in strongly connected digraphs with 
minimum in-degree 3, we start with the method introduced in [T]: in [T] a locally optimal 
out-branching T is considered. It is shown that if T contains a path of length at least 2k'^ 
that contains only vertices that have out-degree 1 in T, an out-branching with at least k 
leaves can be found. If such a path does not exist, and T itself also has less than k leaves, the 
upper bound n < Ak^ follows. In this paper we use the same general idea, but using a more 
sophisticated method to construct out-branchings, we can already find an out-branching with 
at least k leaves if the aforementioned path has length 8k. 

The paper is organized as follows. Definitions and preliminary observations are given in 
Section [21 In Section [3] the FPT algorithm for /c-Leaf Out-Tree is given, and in Section H] 
the FPT algorithm for A;-Leaf Out-Branghing is given. Section |4] also contains the proof 
that i{D)/£s{D) < 3 for digraphs without useless arcs. In Section[5]we prove the lower bound 
for IsiD). 

2 Preliminaries 

General definitions For basic graph theoretic definitions see [8], and for directed graphs 
in particular see [3]. We reuse many of the definitions and observations from [6] in this paper, 
so parts of this preliminaries section are taken literally from [6]. For a digraph D, V{D) 
denotes the set of vertices and A{D) the set of arcs. Arcs are 2-tuples (u, v) where u £ V{D) 
is called the tail and v G V{D) the head. For an arc set B, HEAD(i?) is the set of heads of 
arcs in B. A digraph D is an oriented graph if {u,v) € A{D) implies {v,u) A{D). A dipath 
in a digraph D is a sequence of distinct vertices vi,V2, ■ ■ ■ ,Vr such that {vi,Vi+i) £ A{D) for 
all 1 < i < r — 1. This will also be called a {vi,Vr)- dipath. The digraph consisting of these 
vertices and arcs will also be called a dipath. With such a dipath we associate an order from 
vi to Vr, for instance when talking about the first arc of the path that satisfies some property. 

A partial order is a binary relation that is reflexive, antisymmetric and transitive. A strict 
partial order is irreflexive and transitive. Partial orders will be denoted by ^, and strict 
partial orders by -<. 

For digraphs we will use normal (undirected) tree decompositions. Hence we define a tree 
decomposition of a digraph D as a pair {X, U) where U is an (undirected) tree whose vertices 



4 



we will call nodes, and X = {{Xi : i S V{U)}) is a collection of subsets of V{D) (bags) such 
that 

1. U^eViU)X^ = V{D), 

2. for each arc {v,w) G A{D), there exists an i £ y{U) such that v,w £ Xi, and 

3. for each v G V{D), the set of nodes {i : v £ Xi} forms a subtree of U. 

The width of a tree decomposition {{Xi : i G !/([/)},[/) equals maXjgy((7){|Xj| — 1}. For 
notational convenience, we will also allow the graph [/ in a tree decomposition {X, U) to 
be directed, in this case it should be understood that we actually consider the underlying 
undirected graph of U . 

Definitions for out-trees and out-branchings A subtree T of a digraph D is an out- 
tree if it has only one vertex of in-degree zero, its root. If T is a spanning out-tree of D, i.e. 
V{T) = V{D), then we call T an out-branching of D. The vertices of T of out-degree zero 
are leaves and the vertices of out-degree at least two are called branch vertices. Let Leaf(T) 
denote the set of leaves of T, let Branch(T) denote the set of branch vertices of T, and let 
BrSucc(T) be the vertices of T that have a branch vertex of T as in-neighbor. Note that 
Leaf(T) n BRSucc(r) may not be empty. 

Proposition 1 Let T be an out-tree. T/ien |BRSucc(r)| < 2|Leaf(T)|-2, and |BRANCH(r)| < 
|LEAF(r)| - 1. 

The omitted proofs in this section are straightforward and /or can be found in [TJ [6] . If there 
exists a dipath in D from vertex u to vertex v, we say v is reachable from u (within D). The 
set of all vertices that are reachable from u within D is denoted by Ro{u). (This set includes 
u itself.) 

Proposition 2 Let T be an out-tree of a digraph D, with root r. Then D has an out-branching 
T' with root r, that contains T, if and only if Ro{r) = V{D). 

Let T be an out-tree. Then we write u :<t v ii v £ Rt{u), and u -<t v if in addition v ^ u. 
The following important observation will be used implicitly throughout the paper. 

Proposition 3 Let T be an out-tree. The relation :<t is a partial order on V{T). 

A digraph H is strongly connected if for all pairs u,v £ V{H), a (n, z;)-dipath exists. A 
strong component is a maximal strongly connected subgraph. A strong component H oi D is 
an initial strong component if there is no arc {u,v) £ A{D) with u V{H), v £ V{H). Note 
that all initial strong components can be found in polynomial time. 

Let T be an out-branching of D, and let {u,v) £ A{D)\A{T), where v is not the root of 
T. The 1-change for {u,v) is the operation that yields T + {u,v) — {w,v), where w is the 
unique in-neighbor of v in T. We call an out-branching T 1-optimal if there is no 1-change 
for an arc of A{D)\A{T) that results in an out-branching T' with more leaves. Note that a 
1-optimal out-branching can be found in polynomial time. 

Proposition 4 Let T be an out-branching of D, and let {u,v) £ A{D)\A{T). The 1-change 
for {u,v) gives again an out-branching of D if and only if v u. 
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Proposition 5 Let T he an out-branching of D, and let {u,v) £ A{D)\A(T). The 1-change 
for (u, v) increases the number of leaves if and only if u ^ Leaf(T) and v BrSucc(T). 

An arc {u,v) of a digraph D is useless if D has no out-branching containing {u,v). 

Proposition 6 Let D be a digraph with a vertex r such that Roir) = V{D). An arc {u,v) 
of D with Rd{v) 7^ y{L^) is not useless if and only if there is a dipath in D starting at r that 
ends with (n, v) . 

Note that useless arcs can be removed in quadratic time. 



3 A Faster FPT Algorithm for A;-Leaf Out-Tree 

We now show how back arcs of an out-tree are grouped, that is, how back arcs are assigned 
to vertices of the out-tree. Let T be an out-tree of D with z £ V{T). Then 

BACxj^iz) = {(n, v) e A{D) : V -<T z u}. 

If it is clear what the graphs D and T in question are, the subscript and superscript will be 
omitted. When |Head(Back(2:))| > k for some choice of z, an out-tree with at least k leaves 
is easily found. 

Proposition 7 Let T be an out-tree of D with |Head(Back|^(2;))| > k for some z £ V{T). 
Then D has an out-tree with at least k leaves. 

Proof: Start with the out-tree r[i?7^(z)], which is rooted at z. For every vertex in v £ 
Head(Back^(z)), add an arc from some vertex in u G Rt{z) to v (such an arc exists), 
making v a leaf. □ 

This yields the correctness of Step [1] of the algorithm, which is shown in Algorithm 13.11 



Algorithm 3.1: An FPT algorithm for fc-LEAF Out-Tree. 
Input : A digraph D and integer k. 

for every initial strong component C of D do 
Choose r G V{C), let D' = D[RD{r)]. 
Compute a 1-optimal out-branching T of D' with root r. 
if |Leaf(T)| > k then Return(YES). 

if there exists a vertex z with |Head(Back'^/(2:))| > k then 
Return(YES). 

Construct a tree decomposition of D' with width at most 4A; — 5. 
Do dynamic programming on the tree decomposition of D'. 
if an out-tree with at least k leaves is found then Return(YES). 

8 Return(NO) 



The construction of the tree decomposition of D' is as follows. For the tree of the tree 
decomposition, we simply use the 1-optimal out-branching T itself. For a vertex v £ V{T) 
with {u,v) £ A{T), the bag of the tree decomposition is defined as follows. 

Xy = {u,v} U BRSucc(r) U LEAF(r) U Head(Back|^/(?;)). 
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(If V is the root of T, simply omit u.) The tree decomposition is now {X, T), with X = {X^ 
V G V{T)}. 



Lemma 8 If T is a 1-optimal out-branching of D' , then {X,T) as constructed above is a tree 
decomposition of D' . 

Proof: Every vertex v £ V{D') is included in at least one bag, namely Xy. Now we show 
that for every arc {u, v) £ A{D') there is a bag containing both u and v. If one of its end 
vertices, say v, is in BrSucc(T) or in Leaf(T), then u,v £ X^- If {u,v) G ^{T)-, then 
u,v £ Xy. Otherwise, since T is 1-optimal, we have w.l.o.g. v -<t u (Proposition [U [5]), and 
then we have v £ HEAD(BACK(n)), so u,v £ Xu- 

We now verify the third condition, namely that the vertex set = {u : v £ Xu} induces 
a connected subgraph of T, for every v £ V{T). li v £ Leaf(T) or f S BrSucc(T), then 
Sv = V(T), so the property obviously holds. So now assume v Leaf(T) U BrSucc(T). 
Suppose V £ Xu for some u ^ v, so v £ HEAD(BACK(ti)) or {v,u) £ A{T). It then follows 
by the definition of JiACK(w) and the transitivity of :<t that for every w with v <t w <t u, 
V £ Xu, holds. So r[;St,] contains a path from v to u. This holds for every u with v £ Xu, so 
this subgraph of T is connected. □ 



Proposition 9 Let T be an out-branching of a digraph D with |Leaf(T)| < k — 1. If for 
all vertices z £ V{D) it holds that |Head(Back-^(z))| < k — 1, then the tree decomposition 
{X,T) as constructed above has width at most 4/e — 5. 

Proof: This follows simply from 

\Xu\ = 2 + |LEAF(r)| + |BRSucc(r)| + |Head(Back(u))| < 

2 + (/c - 1) + (2A: - 4) + (A: - 1) = 4A; - 4, 

since |BRSucc(r)| < 2|LEAF(r)| - 2 (Proposition [ID . □ 

When a tree decomposition is given of D', standard dynamic programming methods can 
be used to decide whether D' has an out-tree with at least k leaves (see also [H US]). The 
time complexity of such a procedure is 2*-^("''°s"') • n, where n = |y(L'')| and w is the width 
of the tree decomposition. 

Theorem 10 For any digraph D with n = \V{D)\, Algorithm \3.1\ solves /c-Leaf Out-Tree 
in time 2^^^^°^^'' ■ n^^^^ 

Proof: Lemma [8] shows that the tuple {X,T) we construct is indeed a tree decomposition. 

We now prove that Algorithm 13.11 returns the correct answer in every case. Step [T] and [T] 
are clearly correct. Step [1] is correct by Proposition [71 

To prove the correctness of Step [H suppose an out-tree T with at least k leaves exists in 
D. There is an initial strong component C of D such that T is part of -D[i?/)(r)] for any vertex 
r £ V{C). In the iteration of the algorithm where C is considered, the dynamic programming 
procedure of the tree decomposition will therefore return YES, if it is not returned before in 
Step [T] or [TJ So Step [1] only returns NO when no out-tree with at least k leaves exists. 

Finally we consider the time complexity of Algorithm 13.11 It is easy to see that every 
step of the algorithm can be done in time polynomial in n, except Step [H which takes time 
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20{fciogfc) . since the width of the tree decomposition is at most 4A: — 5. (Proposition [9|) . 
Steps[THI]are repeated at most n times (for every possible choice of initial strong component), 
so in total the complexity becomes 2*^(^'°s^') • n^W _ □ 

Note that Algorithm 13.11 can be made into a constructive FPT algorithm. 

4 A Faster FPT Algorithm for /c-Leaf Out-Branching 

We can modify the previous algorithm in order to solve /c-Leaf Out-Branching, see Algo- 
rithm 14.11 

Algorithm 4.1: An FPT algorithm for fc-LEAF Out-Branching. 
Input : A digraph D and integer k. 

1 if D has no out-branching then Return(NO). 

2 Remove from D all useless arcs to obtain D'. 

3 Compute a 1-optimal out-branching T oi D' . 

4 if |LEAF(r)| > k then Return(YES). 

5 if T has a vertex z such that |Head(Back^,(z))| > 3A: then 

Return(YES). 

6 Construct a tree decomposition of D' with width at most 6A; — 5. 

7 Do dynamic programming on the tree decomposition of D' . 

8 if an out-branching with at least k leaves is found then Return(YES). 

9 Return(NO) 



The main new bound that we use to prove the correctness of algorithm 14. II is proved later 
in Section 14. 1[ There it is shown that if a digraph without useless arcs has an out-tree with at 
least 3k leaves, this can be used to construct an out-branching with at least k leaves. The tree 
decomposition used in the algorithm is exactly the same as the one constructed in Section [3l 
Since |Head(Back(z))| may now be at most 3A; — 1, the width is at most 6k — 5. 

Proposition 11 LetT be a 1-optimal out-branching of a digraph D with |Leaf(T)| < k—1. If 
for all vertices z G V{D) it holds that |Head(Back]^(2;))| < 3A; — 1, then a tree decomposition 
{X, T) of D with width at most 6k — 5 can be constructed. 

We now prove the correctness of Algorithm 14.11 and analyze its time complexity. 

Theorem 12 For any digraph D with n = \V{D)\, Algorithm \4-l\ solves /c-Leaf Out- 
Branching in time 2^^^^°^^^ ■ nP^^\ 

Proof: We first prove that Algorithm 14 . 1 1 returns the correct answer in every case. Step [21 [2] 
and[2]are obviously correct. If |Head(Back(2;))| > 2>k for some z, an out-tree with at least 3/c 
leaves exists (Proposition [7]), which in turn yields an out-branching with at least k leaves since 
D' contains no useless arcs (Theorem 1 13p . This shows Step [2] is correct. If an out-branching 
of D with at least k leaves exists, then this is also an out-branching of D', so in this case YES 
will be returned in Step [21 if not before. This proves the correctness of Step [21 

Finally we consider the time complexity of Algorithm 14. 1[ Every step of the algorithm can 
be done in time polynomial in n, except Step[Tl which takes time 2^^^^°^^^ - n, since the width 
of the tree decomposition is bounded by 6A; — 5 (Proposition llip . In total the complexity 
becomes 2'^^^ • n'^(^) . □ 
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4.1 Constructing Leafy Out-Branchings from Out-trees 



In this section we prove one of the two main bounds of this paper, which yields the correctness 
of Step [2] of Algorithm 14.11 The proof of Theorem [13] can be turned into a polynomial time 
algorithm that constructs an out-branching, and therefore Algorithm 14.11 can be made into a 
constructive FPT algorithm. 

Theorem 13 Let D be a digraph without useless arcs. If i{D) > 3k, then is{D) > k. 

Proof: Let T be an out-tree of D with at least 3A; leaves, and let r be the root of T. If T 
contains at least one vertex v with Roiv) = V{D), then also Rni^) = ^(D), so then T can 
be extended to an out-branching with at least 3k leaves (Proposition [2]). 




Figure 2: (a) Out-tree T and (r', r)-dipath P, and (b) the out-tree T' constructed in Case 1. 

Otherwise, choose an arbitrary vertex r' with R£){r') = V{D) (which exists since there 
are non-useless arcs, and thus at least one out-branching), and let P be an (r', r)-dipath 
that contains a minimal number of vertices of Leaf(T). Let Leaf(T) n V{P) = {/i, . . . , Im}, 
labeled with decreasing labels along P. That is, if i < j, then Ij -<p /j. These definitions are 
illustrated in Figure [2] (a). We distinguish two types of vertices k (i E {1, . . . 

1. Type 1: D — U contains an (x,y)-dipath for some x,y £ V{P) with x ~<p k ~<p y, with 
no internal vertices in V{P). 

2. Type 2: all other vertices li. 

Now we consider three cases: since | Leaf (T)| > Sfc, one of the following holds: (i) |LEAF(T)\y(P)| > 
k, (ii) the number of type 1 leaves is at least k, or (iii) the number of type 2 leaves is at least 
k. In all cases we will find an out-branching with at least k leaves. 

CASE 1: |LEAF(r)\l/(P)| > k. 

We use P and T to construct an out-tree T' of D. This is illustrated in Figure [2] (b). 
To construct T', start with T. For all arcs {u,v) E ^(-P) with v ViT) or v = r, simply add 
{u,v) to the out-tree. For arcs {u,v) E A{P)\A{T) with v E V{T)\{r}, do the 1-change for 
{u, v). Then T' is again an out-tree: for every vertex v E V{T'), an (r', u)-dipath exists in T', 
every vertex except r' has again in-degree 1, and r' has in-degree 0. Also, for every vertex 
V E LEAF(T)\y(P), the out-degree has not changed, so those vertices are still leaves. Thus 
we have an out-tree with at least k leaves, with root r' such that Rd{t') = V{D). This is 
then easily extended to an out-branching with at least k leaves (Proposition [2]). 

CASE 2: The number of type 1 leaves is at least k. 
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P' 
p 

LEAF(r) 

Figure 3: Definitions used in Case 2. Numbers indicate DiST^,. 

The definitions used in this case are illustrated in Figure [3l For every v G Leaf(T), we 
define the following value: if r G Rjj^v), then consider the (f,r)-dipath of D that contains 
the minimum number of LEAF(T)-vertices. Then let DlSTi(i;) denote number of vertices 
in LEAF(r) on this path (including v itself). Note that since we chose P to contain the 
minimum number of LEAF(T)-vertices, we have DiSTj;,(/j) = i. In particular, all vertices li 
receive different values for DiSTx,. 

We now show that for every type 1 vertex li, there is a vertex z G Leaf(T)\1/(P) with 
T)istl{z) = DiSTi(/j). Since li is of type 1, we may consider an (x,y)-dipath P' in D with 
X ~<p li -<p y and no internal vertices in P. By choice of P, P' contains at least one Leaf(T)- 
vertex. Let v be the first LEAF(T)-vertex on P', not equal to x. If DiSTi(?;) < DiSTi(^j), then 
P' can be used to find a path with fewer LEAF(r)-vertices, a contradiction. So P' contains an 
internal vertex v with DiSTi(i)) > DlSTiih)- Now consider the maximum j such that y <p Ij. 
By definition of DiST/,, P' contains a LEAF(T)-vertex w ^ y with Distl(u') < j + l < i. So P' 
also contains an internal vertex w with DiSTx,(t(;) < DiSTx,(/j). Combining this with the fact 
that the DiSTj^-labels decrease by steps of at most one when going along P', it follows that P' 
contains an internal vertex z with DiSTi(z) = DlSTiih)- Internal vertices of P' are not part 
of P, so this proves that there is a vertex z G LEAF(T)\y(P) with Distl(z) = DiSTL(/j), for 
every type 1 vertex li. Since we assumed there are at least k type 1 vertices, and all of them 
receive different labels DiSTx,, this proves that there are at least k vertices in LEAF(r)\y(P), 
so by case 1 above, the desired out-branching exists. 

CASE 3: The number of type 2 leaves is at least k. 

In this case we will use the fact that D contains no useless arcs. The definitions used are 
illustrated in Figured! 

(a): T and P. (b): D, Q, and P' . 




Figure 4: Definitions used in Case 3. 

Let li be a type 2 vertex. Consider the unique (r, /j)-dipath in T. Let (tj, hi) be the last 
arc of this path that is not in A{P). Note that hi = li is possible. Note also that by choice 
of {ti, hi), we have /j+i -<p hi ^p li. Since {ti, hi) is not useless and since we observed in the 
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beginning of this proof that we may assume Roihi) ^ V{D), there is a dipath P' in D that 
starts in r' and ends with the arc (ti, hi) (Proposition [6]). Let Xi be the last vertex on P' with 
Xi -<p hi, and let Zi be the first vertex on P' after with hi Zi. Since r',hi £ V{P'), 
both vertices exist. Let Qi be the subpath of P' from Xi to Zi. So the internal vertices of Qi 
are not part of P, and Xi hi <p Zi. Combining this with /j+i -<p hi :<p li we obtain the 
following useful relations. 

h+i Zi Xi -<p li 

Let Tji be second vertex of Qi. So {xi,yi) £ A{Qi)\A{P), though it is possible that yi G V{P), 
namely when yi = Zi. 

Using these definitions, we can show how to construct an out-branching with at least k 
leaves. Construct T' as follows, starting with P. For every type 2 vertex li, if yi V{P), 
then add {xi,yi). If yi € y{P), then instead do the 1-change for {xi,yi). In order to show 
that this yields again an out-tree, we need to prove that if li and Ij are two different type 2 
vertices, then yi ^ yj. This is done below (Claim 1). Next we need to prove that for every 
type 2 vertex li, a leaf is gained. When yi V{P), this leaf will simply be yi itself. When 
yi S V{P), then the corresponding leaf will be the in-neighbor v of yi with respect to P (so 
{v,yi) € A{P)). To prove that v will indeed be a leaf, we need to show that the other opera- 
tions do not increase its out-degree, hence that v is not equal to xj for some other type 2 vertex 
Ij. This is also proved below (Claim 2). Together this shows that T' is an out-tree with root 
r' with at least k leaves, which is easily extended to the desired out-branching (Proposition [2]). 

Claim 1: For two type 2 vertices li and Ij with i < j, yi ^ yj- 

Suppose yi = yj. Consider the path Qi, and replace the first arc with the arc {xj,yi). This 
gives an (xj, Zi)-dipath with Xj -<p Ij :<p /j+i -<p Zi, which shows Ij is in fact a type 1 vertex, 
a contradiction. 

Claim 2: Ifyi = Zi for some i (so (xj, Zi) G A{D) ), then there exists no type 2 vertex j such 
that {xj,Zi) e A{P). 

To obtain a contradiction, assume that {xj, Zi) E A{P). Note that i ^ j. Since /j+i -<p Zi and 
Xj -<p Ij, it follows that /j+i -<p Ij, so i > j. Using i / j it follows that i > j, and therefore 

k Ij+i- 

We have an arc {xi,Zi) E A{D) with Xi -<p Zi. By choice of P it is not possible that 
Xi -<p Iq -<p Zi for any q, since then a path containing fewer leaves of T could have been 
chosen. But Xi -<p li, so also Zi :<p li. Now we use the assumption that {xj,Zi) E A[P), 
which yields Xj -<p Zi <p li <p /j+i ~<p Zj, so the path Qj shows that li is in fact a type 1 
vertex, a contradiction. □ 

5 Lower bounds for the number of leaves 

The following lemma can be used for instance to find leafy out-branchings in digraphs D with 
minimum in-degree 3 (which is needed to satisfy the third condition). Its proof is postponed 
to the end of this section. 

Lemma 14 Let T he an out-branching of a digraph D, and let P = vq, . . . ,Vp-i be a dipath 
in T where 
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• D contains no arcs {vi,Vj) with i < j, 

• V{P) contains no branch vertices ofT, and 

• every Vi has an in-neighbor in D other than or fj+i- 
Then D has an out-tree with at least p/8 leaves in V{P). 

Lemma [13] is the key ingredient for our main result of this section. Apart from using this 
stronger lemma and a shorter formulation, the proof of the next theorem is essentially the 
same as the one used in [T]. 

Theorem 15 Let D be a digraph on n vertices with at least one out-branching. If D has 
minimum in-degree 3, or if D is an oriented graph with minimum in-degree 2, then i{D) > 

Proof: Let k = \^/n. Consider a 1-optimal out-branching T of D. We only have to 
consider the case that |LEAF(r)| < A; — 1, and thus |BRANCH(r)| < k — 2 (Proposition [T]) . 
Consider the set V of all maximal dipaths in T that contain no branch vertices. Note that 
every non-branch vertex of T is in exactly one such path, so the paths in V give a partition 
of V^(r)\BRANCH(r). Note that every path in V either ends in a leaf of T, or ends in a 
vertex u such that there is a branch vertex v € V{T) with {u,v) G ^(T), and that for every 
branch vertex v there is at most one such u. Hence the number of paths in V is bounded by 
|Leaf(T)| + |BRANCH(r)| <2k-3. 

For every path vq, . . . , Vp-i in V we may apply Lemma [TU since D either has minimum 
in-degree 3 or is an oriented graph with minimum in-degree 2, every Vi has an in- neighbor in 
D other than Vi-i or Vj+i. Since T is 1-optimal, there are no arcs {vi,Vj) in D with i < j 
(Proposition [U Proposition [5]). Hence if one of these paths contains at least 8k vertices, 
the desired out-tree exists (Lemma I14p . So finally suppose every path in V has less that 8A: 
vertices. This yields 

n < 8k{2k - 3) + k - 2 < 16A;^ 

a contradiction with our choice of k. Hence in every case an out-tree with at least \\/n leaves 
can be found. □ 

Combining Theorem 1151 with Proposition [2] and Theorem 1131 respectively, we immediately 
obtain the following bounds for out-branchings. 

Corollary 16 Let D be a digraph on n vertices that has minimum in-degree 3, or has mini- 
mum in-degree 2 and is an oriented graph. 

• If D is strongly connected, then £s{D) > 

• If D contains no useless arcs, then £s{D) ^ i^V^- 

It remains to prove Lemma [Hi For this we will use the following lemma from [6]. 

Lemma 17 Let T be an out-branching of D with root r. Let Q be a dipath in D that starts at 
r. Then making all of the 1-changes for every arc in A{Q)\A{T) yields again an out-branching 
of D that contains Q. 
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Proof of Lemma I14t Let T be an out-branching of a digraph and let P be a dipath in T 
that satisfies the properties stated in the lemma. Let r be the root of T. If Vp-i is not a leaf 
of r, then let Vp be the unique out-neighbor of fp_i in T. In this case, we add the arc (r, Vp) 
to D (if it is not already present), and apply the 1-change for (r, fp) to T. So in both cases, 
from now on we may conveniently assume that Rx{vi) = {fj, . . . In the remainder of 

the proof we will use this to show that D has an out-branching with at least p/4 leaves in 
V{P). Prom this the statement follows; if we added {v^Vp) then removing this arc from the 
out-branching will give two out-trees of the original digraph D, of which at least one has at 
least p/S leaves in V{P). 

If an arc {vi,Vj) is present in D, then i > j. Arcs of this type are called back arcs. (Note 
that this is a subset of the arcs that were called back arcs in Section [T]) 

We will now iteratively make changes to T until every Vi S y{P) is either a leaf, or is the 
tail of a back arc. The property of T being an out-branching will be maintained throughout. 
To assist in the later analysis, tails of back arcs will be colored green or white as soon as these 
back arcs are added (details are given below). 

Changes to T are made in p — 1 stages. During stage i {i G {1, . . . ,p — 1}), the goal is to 
make vertex Vi-i a leaf, if this is still possible. For this we consider a dipath Qi that ends in 
the vertex Vi, and make 1-changes based on this path. The changes we make when considering 
the vertex Vi will only involve arcs that are incident with vertices of P with higher index, and 
vertices not in P. So in stages later than stage i, no changes are made to the arcs incident 
with Vj, for j < i. In particular, Vi-i will remain a leaf if it is made a leaf in stage i. 

Before we define Qi, we observe that the following properties hold for T. These properties 
will be maintained throughout the procedure, and will therefore be called invariant properties. 
Note that the second property follows from the first. 

1. Vi has only out-neighbors in {vi, . . . , t^j+i}, for all i £ {0, . . . ,p — 1}. 

2. Rrivi) C ViP). 

The changes that will be made to T will consist of adding back arcs and adding arcs with tail 
not in P, and removing arcs of the form (vj,Vj+i). Figure [5] (a) shows an example of how the 
out-branching may look after five stages (only the vertices of P are shown). Note that the 
invariant still holds even though the set of reachable vertices may change for a vertex Vi. 

The operation of stage i, and the dipath Qi that we use for it is defined as follows. Let Tj 
denote the out-branching as it is in the beginning of stage i, so Ti = T. The changes in stage i 
will yield a new out-branching Tj+i. In Figure [5] an example is shown where Ty is constructed 
from Tg. The dashed arcs in FigureO (b) show the dipath Qq. If Vi-i is already a leaf or a tail 
of a back arc in Tj, we do nothing, so Ti^i = Ti. Otherwise, Vi is the only out-neighbor of Vi-i 
in Ti (invariant Property 1). Then we consider a dipath Qi = x, v^r^q), v^r^q-i), . . . , in D 
that ends in Vi, and has x RT^{vi), constructed as follows. Let a{l) = i. By our assumption, 
Vi has an in-neighbor u in D that is not equal to or Vi+i. Since all arcs between vertices 
in V{P) are back arcs and Rxi{vi) C V{P) (invariant Property 2), this vertex u is either 
not in RT.{vi) or it is equal to Vj for some j > i + 2. In the first case, Qi = u,Vi. In the 
second case let a{2) = j, and continue constructing the path using the same rule: Vj has an 
in-neighbor that either is not in i?^. (vj), or is equal to vi for some I > j + 2, etc. 

This process will terminate with a dipath Qi = x,v„(^q~^, . . . ,'L'o-(i), where ^^-(i) = fi, the 
function a increases in steps of at least 2, and x RT.{vi). (Note that x may or may not be 
in V{P).) It follows that if we make 1-changes for all arcs in Qi, again an out-branching is 
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Figure 5: Stage 6: constructing T-j from Tg. 



obtained (Lemma 1171 note that we can easily extend Qi to start in r), and becomes a 
leaf. Observe also that the invariant properties are maintained by these changes. This yields 

In addition we assign the following colors to vertices. All vertices of Ti start out being 
black. In Tj+i we color the vertices as follows: 

• The first vertex x of Qi is colored white. 

• The internal vertices ^^-(j) of Qi {j G {2,...,q}) are colored green, unless they were 
already white in Tj. 

• In all other cases vertices receive the same color as they have in Tj. 

We say a vertex becomes green (white) in stage i if in Tj+i it is green (white) but in Ti it is 
colored differently. We observe that the following green vertex properties hold: 

1. If Vj is green in Ti, then {vj-i,Vj) A{Ti). 

2. If a vertex Vj becomes green in stage i, then Ti contains the dipath Vi, . . . , Vj. 

After p — 1 stages, this procedure terminates with the out-branching Tp. In order to give 
a lower bound for the number of leaves in Tp, we map all vertices that are tails of back arcs 
(green and white vertices) to leaves. This mapping is as follows: 

• A white vertex of Tp that became white in stage i is mapped to the leaf Vi^i (note that 
this is still a leaf in Tp). 

• A green vertex vj of Tp is mapped to 

— fj-i if it is a leaf, 

— to the leaf Vi-i if vj^i first became white in stage i, and 

— to the leaf if is green and became green during stage i. 
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Finally, we show that every leaf has at most 3 preimages in this mapping. Consider a leaf 
Vi-i. If Vi-i was already a leaf in Tj (no changes are made in stage i), then only the vertex 
Vi may be mapped to Vj-i, if fj is still green in Tp. 

On the other hand, if Vi-i is made a leaf during stage i, then at the beginning of stage 
i, its unique out-neighbor is Vi, which therefore is not green (green vertex Property 1) and 
neither will be colored green during later stages. However, the single vertex vj that becomes 
white in stage i is mapped to fj-i. In addition, Vj^i may be green in Tp, in which case this 
vertex is also mapped to fj-i. Considering the above assignment rules, the only other vertices 
that may be mapped to Vi^i are green vertices Vj such that vj-i became green during stage 
i. We now argue that there is at most one such vertex. The vertex vj did not become green 
during a stage s for s > i, since in stage i the arc {vj^2,Vj-i) is removed, and Vj only becomes 
green in stage s if fs, f^+i, . . . ,Vj is a dipath in T during stage s (green vertex Property 2). 
On the other hand, if Vj became green during an earlier stage, then during stage i the arc 
{vj-i,Vj) is not present anymore (green vertex Property 1). This means that Vj-i must be 
the second vertex of the path Qi, since the in- neighbor of Vj-i that is added to Qi is not in 
Rxiivi), so the construction of Qi ends after one more step. Hence there can only be one such 
vertex. 

This concludes the proof that every leaf has at most 3 preimages in our mapping. In 
addition, in Tp every vertex of V{P) is either a leaf, a green vertex, or a white vertex (note 
that Vp-i starts out as a leaf). All white vertices and green vertices are mapped to leaves. 
It follows that at least p/4 vertices of P end up being leaves. Together with the observation 
made in the beginning of the proof, the statement follows. □ 

6 Discussion 

In Section [4. II we showed that for digraphs D without useless arcs, 1{D) / ^s{D) ^ 3 holds. In 
Section [T] we gave a simple example where i{D)/is{D) = 2. This leaves the question what the 
worst possible ratio may be. More complex examples exist where i{D)/is{D) = 2.5. Figure [6] 
shows how to construct such a digraph D consisting of k digraphs on six vertices, two extra 
vertices r and r', and arcs between these. This graph has an out-tree T with 5k leaves (with 
root r), but it can be verified that is{D) = 2/i; (any out-branching needs to have r' as root). 
For clarity, some arcs of T are drawn as half arcs; these arcs should be understood as having 
r as tail. Observe that no arc in D is useless. 



We believe that this is the worst possible ratio. However bridging the gap between the 
factors 3 and 2.5 may require a long proof and may not be worth the effort. 

Similarly, we do not believe that the factor \ from Corollary [16] is tight. We do not know 
what the best possible factor could be, but we do have examples showing that the analysis 
from Lemma [HI is tight (up to a small additive term); to get a better factor, the construction 



o 



* : A{T) 

> : A{D)\A(T) 
: LEAF(r) 




Figure 6: A digraph D with e{D)/£s{D) = 2.5. 
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of the out-branching would need to be changed. Again we do not expect it to be worth the 
effort to improve the factors here. 

It seems that in order to significantly improve the parameter function of FPT algorithms 
for these problems further, a different approach is needed, one that is not based on dynamic 
programming over a tree decomposition. Improving the factor from Theorem [T3] from 3 to 
2.5 would for instance slightly improve the constant that is suppressed by the O-notation 
in the expression 

20(fciogfc)^ but we do not consider this a significant improvement. It is an 
interesting question whether different, significantly faster FPT algorithms are possible for 
these two problems, for instance FPT algorithms with a parameter function of the form 
for some constant c. Such algorithms exist for the undirected version (with c = 6.75, see [7j). 
This was also asked in |15j . 
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